11:10
2026-06-24
byteiota.com
ai-research
SWE-bench Pro: How to Read the Coding Agent Leaderboard
OpenAI abandoned SWE-bench Verified on February 23, 2026, after finding 59.4% of its hardest failed tests were broken and training data contamination inflated scores. Its replacement, SWE-bench Pro frโฆ